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A  new  method  for  obtaining  optimized  parameters  for  semiempirical  methods  has  been  developed  and 
applied  to  the  modified  neglect  of  diatomic  overlap  (MNDO)  method.  The  method  uses  derivatives  of 
calculated  values  for  properties  with  respect  to  adjustable  parameters  to  obtain  the  optimized  values  of 
parameters.  The  large  increase  in  speed  is  a  result  of  using  a  simple  series  expression  for  calculated 
values  of  properties  rather  than  employing  full  semiempirical  calculations.  With  this  optimization  pro¬ 
cedure,  the  rate-determining  step  for  parameterizing  elements  changes  from  the  mechanics  of  parame¬ 
terization  to  the  assembling  of  experimental  reference  data. 


INTRODUCTION 

The  evolution  of  molecular  orbital  (MO) 
methods  has  resulted  in  two  main  methods, 
ab  initio  and  semiempirical.  Of  these,  ab 
initio,  having  no  need  for  empirically  deter¬ 
mined  parameters,  is  the  more  theoretically 
pure.  Such  methods,  however,  are  currently 
veiy  slow,  and  routine  application  at  any  rea¬ 
sonable  degree  of  accuracy  to  systems  of  even 
a  few  tens  of  atoms  is  still  not  practical.  In 
contrast,  semiempirical  methods  employ  em¬ 
pirically  determined  parameters  and  are 
thus  less  attractive  to  the  theoretical  purist. 
Unlike  ab  initio  methods,  the  accuracy  of  any 
empirical  method  is  limited  to  the  accuracy 
of  the  experimental  data  used  in  obtaining 
the  parameters.  However,  semiempirical 
methods  are  fast  enough  and  accurate 
enough  for  routine  application  to  quite  large 
systems.  In  consequence,  a  few  semiempiri¬ 
cal  methods,  in  particular  modified  neglect 
of  diatomic  overlap  (MNDO),‘  have  become 
very  popular.  To  a  large  degree,  this  is  a  re¬ 
sult  of  their  computational  simplicity  as  well 
as  their  chemically  useful  accuracy.^ 

The  power  of  semiempirical  methods  lies 
not  in  their  theoretical  rigor,  but  in  the  fact 
that  adjustable  parameters  within  the  meth¬ 
ods  are  optimized  to  reproduce  important 
chemical  properties.  A  comparison  of  ab  initio 
and  semiempirical  methods  has  shown  that, 
for  heats  of  formation  of  systems  related  to 


those  for  which  the  semiempirical  methods 
were  parameterized,  the  accuracy  of  semiem¬ 
pirical  methods  is  comparable  with  that  of 
quite  large  basis  set  ab  initio  calculations.^ 

Any  semiempirical  method  is  composed  of 
a  theoretical  framework  and  a  set  of  parame¬ 
ters.  Ideally,  these  parameters  should  be 
fully  optimized  using  an  overdetermined  ref¬ 
erence  set  of  molecules,  but  hitherto  this  has 
not  proven  possible.  In  consequence,  the  qual¬ 
ity  of  semiempirical  methods  has  not  been 
properly  tested.  It  is  impossible  to  partition 
errors  in  calculated  quantities  between  defi¬ 
ciencies  in  the  theory  and  incompleteness  of 
the  optimization  of  the  parameters.  Only 
when  the  parameters  for  a  semiempirical 
method  have  been  fully  optimized  can  the 
quality  of  the  theoretical  framework  be  de¬ 
termined.  In  the  limit,  when  fully  optimized 
parameters  are  used,  the  quality  of  a  method 
depends  solely  on  the  nature  of  the  approxi¬ 
mations  used,  and  more  sophisticated  meth¬ 
ods  will  be  the  more  accurate. 

This  is  exemplified  by  comparison  between 
MNDO  and  CNDO^  (complete  neglect  of  dif¬ 
ferential  overlap).  MNDO  is  a  more  sophis¬ 
ticated  method  than  CNDO,  taking  into 
account  lone-pair/lone-pair  repulsions;  there¬ 
fore,  a  properly  parameterized  MNDO  model 
should  perform  better  than  an  equivalent 
CNDO  model.  However,  until  the  parameters 
within  a  given  method  are  fully  optimized, 
such  a  "method”  must  be  regarded  as  the 
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combination  of  the  theoretical  framework 
and  the  current  set  of  parameters.  Once  the 
parameters  for  a  method  are  fully  optimized, 
then  reference  to  the  "method”  will  mean 
the  theoretical  framework  only. 


BACKGROUND  TO  PARAMETERIZATION 

Many  elements  have  been  optimized  in 
the  MNDO  method.  For  the  earlier  elements 
parameterized,  a  modified  form®  of  Bartels’® 
method  was  used.  This  general  function  opti¬ 
mization  procedure  did  not  make  use  of  any 
derivatives.  At  that  time  derivatives  methods 
were  not  practical;  "The  amount  of  computa¬ 
tion  time  required  would  become  excessive 
even  for  molecules  of  average  size.”^  In  con¬ 
sequence,  elements  were  parameterized  only 
with  great  difficulty.  Following  the  initial 
parameterization  of  four  elements  (hydrogen, 
carbon,  nitrogen,  and  oxygen),  several  other 
elements  were  parameterized  rapidly.  In 
these  later  parameterizations  the  values  of 
the  parameters  for  earlier  elements  were  kept 
constant.  A  result  of  this  serial  parameteri¬ 
zation  is  that  only  for  the  set  of  elements  first 
parameterized  could  the  parameters  be  fully 
optimized  and  internally  consistent.  Thus 
the  parameters  of  bromine’  were  optimized 
to  reproduce  bromine  chemistry  while  holding 
the  values  of  all  earlier  parameters  constant. 
The  operation  of  parameterizing  elements  is 
extremely  tedious  and  simultaneous  optimiza¬ 
tion  of  several  elements,  other  than  the  initial 
C — H — N — O  set,  has  not  been  attempted. 

Historically,  in  order  to  allow  facile  com¬ 
parison  of  results,  the  number  of  sets  of 
MNDO  parameters  has  been  kept  to  an  ab¬ 
solute  minimum.  Only  when  a  readily  iden¬ 
tifiable  and  significant  deficiency  has  been 
identified  and  corrected  has  a  new  set  of  pa¬ 
rameters  been  published.  The  set  of  parame¬ 
ters  reported  here  is  in  keeping  with  this 
philosophy  in  that  they  correct  various  errors 
which  are  present  when  the  basic  MNDO 
parameters  are  used. 

Optimization  of  parameters  has  proved  to 
be  a  difficult  operation  for  three  reasons. 
First,  there  has  often  been  a  paucity  of  accu¬ 
rate  experimental  data.  In  certain  cases, 
such  as  simple  gas-phase  organic  compounds 
in  their  ground  state,  good  experimental 
data  exist,®  but  the  necessary  accurate  ex¬ 


perimental  data  for  other  compounds  are 
frequently  absent.  Second,  optimization  of 
parameters  is  very  time  consuming,  involv¬ 
ing  complete  geometry  optimizations  for  a 
large  number  of  molecules  at  the  semiempiri- 
cal  level.  Third,  the  parameters  are  all  highly 
interdependent.  With  a  nearly  optimized  set 
of  parameters,  any  significant  change  in  any 
parameter  must  be  accompanied  by  a  corre¬ 
sponding  change  in  several  other  parame¬ 
ters.  Optimization  of  the  parameters  one  at 
a  time  would  be  prohibitively  slow.  This  in¬ 
terdependency  exists  not  only  within  the  pa¬ 
rameters  of  a  single  element,  but  between 
parameters  of  different  elements.  For  ex¬ 
ample,  the  optimized  value  of  the  orbital  ex¬ 
ponent  for  hydrogen  is  directly  affected  by 
the  value  of  the  orbital  exponent  for  carbon. 

A  new  and  completely  general  optimiza¬ 
tion  procedure  for  parameterizing  semiem- 
pirical  methods  of  the  type  used  in  MOPAC® 
is  described  here.  In  this  method,  use  is 
made  of  the  fact  that  over  a  sufficiently 
small  domain  the  calculated  values  of  prop¬ 
erties  depend  linearly  on  the  values  of  the 
parameters  used.  This  allows  calculated 
quantities  to  be  approximated  by  a  simple 
sum  of  constants  times  parameters.  At  inter¬ 
vals,  explicit  semiempirical  calculations 
which  may  involve  geometry  optimization 
are  needed  to  monitor  the  validity  of  this 
approximation,  and  to  correct  accumulated 
errors.  Given  a  set  of  experimental  data,  the 
method  described  here  can  be  used  to  obtain 
a  fully  optimized  set  of  parameters  relatively 
rapidly. 


THEORY 

It  is  important  to  know  how  each  parameter 
is  used  within  the  set  of  approximations  used 
by  MNDO  and  AM1.^°  Thus  a  brief  outline 
of  each  parameter  will  now  be  given.  A  more 
complete  description  of  these  methods  can  be 
found  in  Refs.  1  and  10. 

For  MNDO  the  calculated  heat  of  forma¬ 
tion  is  given  by 

=  F^Eiect  +  -Enuc  “ 

A  A 

(1) 

where  Ai/J?  is  the  experimental  heat  of  for¬ 
mation  for  atom  A.  E^i  is  the  calculated  en- 
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ergy  of  formation  of  the  gaseous  atom  A 
from  its  ion, 

Eei  =  fiPi, ,  U,, ,  {ii \jj ),{ij\ij)),  (2) 

where  P„  is  the  ground  state  atomic  orbital 
population  of  atomic  orbital  i.  The  Ua  are 
the  one-electron  energies  of  the  atomic  or¬ 
bitals  of  the  ion  resulting  from  removal  of 
all  valence  electrons.  These  are  represented 
by  the  parameters  and  Upp.  are 

the  two-electron  one-center  integrals.  In 
general,  there  are  five  of  these  per  atom; 
these  are  represented  by  the  parameters: 
{ss|ss)  =  {ss\pp)  =  G,p  (ppIpp)  =  Gpp, 
ipplP'p'}  =  Gp2,  and  {sp\sp)  =  H,p.  Here  p 
and p'  are  p-type  atomic  orbitals  on  a  single 
center  but  differing  in  their  angular  func¬ 
tional  form. 

£nuc  is  the  nuclear-nuclear  repulsion 
energj'  and  is  given  by 

■S'Nuc  =  (3) 

‘<7 

with 

pN^hj)  ~  Zj^Zb{AA\BB) 

•  (1  + 

+  (ZiZjfRij) 

^  k 

+  (4) 

k  ' 

in  which  atom  i  is  of  type  A,  atom  j  is  of  type 
B,  {AA\BB)  is  a  two-center  integral  of  type 
(ss|ss),  Zj  is  the  number  of  valence  electrons 
on  atom  i,  and  i?y  is  the  interato^  it  dis¬ 
tance.  Only  the  first  term  in  this  exp  '"sion 
is  present  in  MNDO.  The  second  term  was 
introduced  in  the  AMI  method  in  order  to 
correct  for  excessive  long-range  repulsions 
in  the  core-core  repulsion.  The  quantities 
.  Oi* .  ^ik )  and  Cj*  are  optimizable  parame¬ 
ters. 

There  are  minor  variations  on  this  expres¬ 
sion  when  O  —  H  or  N  —  H  interactions  are 
involved;  these  are  detailed  in  Ref.  1. 

The  total  electronic  energy  is  given  by 

•®Elect  ~  (1/2)P(H  +  F) ,  (5) 

with  P,  H,  and  F  being  the  density,  one-elec¬ 
tron,  and  Fock  matrices,  respectively.  The 
Fock  matrix,  over  atomic  orbitals,  is  com¬ 
posed  of  one-  and  two-electron  terms.  Diago¬ 
nal  Fock  matrix  elements  are  defined  by 


Fmm  =  Umm  +  E  +  SP,, 

B  •' 

■  [{pp\vv)  -  {pv\pu)] 

B  B 

^  S  S  S  PA.(r(pp|A,cr) ,  (6) 

B  K  cr 

where  V^^,b  is  the  potential  experienced  by 
atomic  orbital  due  to  all  nuclei  in  the  sys¬ 
tem  other  than  that  of  (t>p.  Matrix  elements 
connecting  orbitals  on  different  atoms  are 
defined  by 

K(T 

(7) 

where  and  are  atomic  parameters  of 
type  )8s  or  /3p.  The  overlap  is  the  overlap 
of  two  Slater  atomic  orbitals  of  type 

<l>p  =  e-^^Yrie,(}>),  (8) 

in  which  ^  is  an  atomic  orbital  exponent  of 
type  oi'  ^p-  Integrals  of  the  type 
are  complicated  functions  of  the  Slater 
orbital  exponents  and  the  two-electron  one- 
center  integrals.  The  simplest  of  these  two- 
center  terms  is  the  <ss|s's')  integral,  the  re¬ 
pulsion  between  two  electrons  each  in  an 
s  orbital.  This  integral  has  the  form 


27.21 

(ss|s's')  = 

(9) 

All  other  two-center  two-electron  inte¬ 
grals  involve  the  Slater  orbital  exponents.  A 
full  description  of  these  functions  is  given  in 
Ref.  11. 


In  the  preceding  discussion  12  variables 
were  identified  per  element  within  MNDO 
which  could  be  regarded  as  parameters.  In 
earlier  work,  only  seven  of  these  were  opti¬ 
mized:  these  are  given  the  symbols  Ugs ,  Upp , 
Bs,^p,Ly^p,  and  a.  The  remaining  five,  G,*, 
Gsp,  Gpp,  Gp2,  and  H,p,  while  adjustable 
parameters  in  principle,  were  assigned  val¬ 
ues  determined  from  atomic  spectra. The 
AMI  method  introduced  a  second  set  of  pa¬ 
rameters  in  the  form  a  gaussian  core-core 
interaction  to  correct  for  excessive  long- 
range  repulsion  in  the  original  MNDO  core¬ 
core  repulsion  term.  By  assigning  up  to  four 
such  gaussians  to  each  atom,  as  many  as 
12  more  parameters  are  introduced.  Each 
element  in  AMI  is  described  by  10  to  19 
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MND0-PM3 

The  theoretical  framework  for  AMI  is 
similar  to  that  of  MNDO,  the  only  difference 
being  the  presence  in  AMI  of  radial  gaus- 
sians  which  modify  the  core-core  repulsion 
term.  As  there  have  been  two  earlier  parame- 
terizations  of  the  MNDO-type  model,  in  order 
to  facilitate  discussion,  the  resulting  method, 
namely,  the  combined  theoretical  framework 
and  optimized  parameter  set,  is  best  described 
as  a  modified  neglect  of  diatomic  overlap, 
parametric  method  3,  or  MNDO-PM3. 

In  this  work,  the  parameter  set  will  con¬ 
sist  of  18  parameters  for  each  element,  with 
the  exception  of  hydrogen,  which  will  have 
11  parameters.  For  convenience,  the  pa¬ 
rameters,  P  =  Pi, . . .  ,Pi, . . .  ,Pn  (i  - 
are  always  represented  in  the  following 
sequence, 

Gpp,Gp2,Hsp,ai,bi,Ci,a2,b2,C2.  For  hydrogen 
the  seven  parameters  involving  p  orbitals 
are  absent. 

REFERENCE  FUNCTIONS 

As  the  objective  of  optimizing  parameters 
is  to  accurately  reproduce  experimentally 
observed  values  of  chemical  properties,  these 
quantities  must  be  used  in  the  optimization 
procedure.  This  is  accomplished  by  using 
reference  functions  which  represent  experi¬ 
mentally  observed  phenomena.  Reference 
functions  are  dimensionless  quantities  which 
can  be  identified  with  an  experimentally  ob¬ 
servable  phenomenon.  An  example  would  be 
the  reference  function  which  represents  the 
heat  of  formation  of  water.  The  experimental 
reference  function  is  obtained  by  multiplying 
the  observed  heat  of  formation  by  a  weight¬ 
ing  factor  in  (kcal/mol"^).  The  resulting  quan¬ 
tity  is,  as  mentioned  above,  dimensionless. 

Molecular  properties  are  used  to  define 
reference  functions.  The  parameters  are  ad¬ 
justed  to  minimize  the  square  of  the  differ¬ 
ence  between  the  calculated  and  experimental 
reference  function.  Within  this  parameteri¬ 
zation  method  three  tjqies  of  reference  func¬ 
tions  are  used. 

Experimental  Reference  Functions 

Most  reference  functions  are  derived  from 
experimental  data.  In  a  few  instances,  how¬ 
ever,  high-level  ab  initio  results  are  used 


where  experimental  data,  particularly  for 
geometries,  is  either  absent  or  of  low  qua¬ 
lity.  As  with  previous  optimizations,  the  sets 
of  properties  to  be- reproduced  consist  of  heats 
of  formation,  dipole  moments,  ionization 
potentials,  and  molecular  geometries.  In  ad¬ 
dition,  heats  of  reaction  and  heats  of  forma¬ 
tion  for  parts  of  molecules  were  used. 

On  occasion,  a  heat  of  reaction  is  known 
with  higher  accuracy  than  the  heats  of  for¬ 
mation  of  the  reactants  or  products.  In  these 
instances,  use  of  heats  of  formation  is  unsat¬ 
isfactory,  and  the  heat  of  reaction  may  be 
used  instead. 

There  are  a  number  of  n-alkanes  for  which 
the  heats  of  formation  are  known.  The  dif¬ 
ference  in  heats  of  formation  of  the  higher 
homologs  rapidly  converges  to  a  constant 
value:  -4.7  kcal/mol  per  methylene  unit. 
Rather  than  use  all  available  data  on  the 
higher  alkanes,  a  reference  function  can  be 
constructed  from  the  difference  in  heats  of 
formation  of  two  adjacent  n-alkanes.  By 
assigning  an  increased  weight  to  this  refer¬ 
ence  function  the  higher  alkanes  can  be  rep¬ 
resented.  The  increased  weight  will  not 
necessarily  result  in  a  reduced  error  for 
the  n-alkanes  used  as  reference  functions. 
However,  the  systematic  error,  that  is,  the 
difference  in  error  between  the  various  higher 
alkanes,  will  be  reduced. 

All  reference  functions  used  were  for  prop¬ 
erties  of  gas-phase  systems,  with  the  excep¬ 
tion  of  geometric  parameters  derived  from 
x-ray  measurements  on  solids.  In  general, 
compilations  of  reference  data  were  used; 
these  appeared  to  have  higher  internal  con¬ 
sistency.  Heats  of  formation  used  experi¬ 
mentally  were  relative  to  the  elements  in 
their  standard  state  except  for  phosphorus, 
for  which  white  phosphorus  was  used  in¬ 
stead  of  red.  Where  possible,  experimental 
ionization  potentials  (IP)  were  those  ob¬ 
tained  by  photoionization  rather  than  by 
electron  impact.  Where  possible,  geometries 
derived  from  microwave  studies  were  used 
in  preference  to  x-ray. 

As  mentioned  earlier,  reference  functions 
must  be  made  dimensionless  in  order  to  be 
compared.  This  is  accomplished  by  multiply¬ 
ing  the  vpl’ie  of  each  reference  function  by 
a  constant  or  weighting  factor,  Wi,  so  that 
the  reference  data  used  in  the  optimization 
is  dimensionless.  For  heats  of  formation. 
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dipoles,  and  IPs,  the  default  weights  were 
1.0  kcal/mol'^  20  D“\  and  10  eV~\  respec¬ 
tively.  It  is  inconvenient  to  use  experimental 
geometries  as  reference  functions.  Instead, 
the  derivative  of  the  heat  of  formation  with 
respect  to  geometry  for  the  reference  ge¬ 
ometry  was  used.  This  derivative  will  be  ap¬ 
proximately  proportional  to  the  amount  by 
which  the  predicted  and  observed  geome¬ 
tries  differ;  the  proportionality  constant  be¬ 
ing  the  force  constant  for  that  deformation. 
Only  when  the  predicted  and  observed  ge¬ 
ometries  are  identical  will  the  derivative  be¬ 
come  zero.  The  default  value  of  Wi  chosen  for 
geometries  was  0.7  kcal/mol”^  per  A.  For 
certain  functions  a  higher  or  lower  weight¬ 
ing  factor  was  used,  depending  on  their 
chemical  importance.  Thus  Wi  for  the  heat  of 
formation  of  benzene  was  increased  to  re¬ 
flect  its  importance  as  the  prototypical  aro¬ 
matic  compound.  For  similar  reasons  Wi  for 
the  heat  of  formation  of  a  methylene  group 
in  an  alkane  was  also  increased. 

Calculated  Reference  Functions 

The  values  of  molecular  properties  calcu¬ 
lated  using  the  current  set  of  parameters  are 
also  used  for  constructing  reference  func¬ 
tions.  Ideally,  these  calculated  reference 
functions  would  be  equal  in  value  to  the  ex¬ 
perimental  reference  functions. 

With  the  exception  of  geometric  reference 
functions,  all  calculated  reference  functions 
are  evaluated  using  fully  optimized  geome¬ 
tries.  During  the  initial  stages  of  the  param¬ 
eter  optimization,  the  geometries  changed 
relatively  rapidly,  although  the  total  change 
was,  with  few  exceptions,  quite  small.  The 
starting  geometries  generated  using  MNDO 
parameters  were  quite  close  to  both  the 
experimental  geometries  and  to  the  ge¬ 
ometries  calculated  using  the  final  opti¬ 
mized  parameters. 

Heat  of  formation  and  dipole  reference 
functions,  Qi,  were  derived  from  the  self-con¬ 
sistent  field  (SCF)  heats  of  formation  and 
dipoles  as 

qi  =  WiDf‘.  (10) 

Ionization  potential  reference  functions 
were  simply  derived  from  the  molecular  or¬ 
bital  energy  levels  via 


q,  =  -w,E{-n), ,  (11) 

where  n  applies  to  the  ionization  potential, 
and  e(-n)  is  the  MO  energy  level  n  -  1  below 
the  highest  occupied  energy  level.  In  the 
case  of  doublet  states,  a  correction  was  made 
for  the  half-electron  approximation. 

Geometric  reference  fimctions  were  derived 
from  the  calculated  derivatives  of  the  AH/- 
with  respect  to  the  appropriate  geometric  co¬ 
ordinate  using  experimental  geometries, 
that  is; 


where  Xj  is  an  experimentally  determined 
coordinate,  either  a  bond  length,  an  angle, 
or  a  dihedral  angle. 

Simplified  Reference  Functions 

The  form  in  which  the  parameters  are 
normally  used  (as  in  the  equations  given 
above)  has  serious  limitations.  The  effect  of 
modifying  parameters  without  performing  a 
full  MNDO  calculation  involving  SCF  calcu¬ 
lations  and,  in  certain  cases,  even  geometry 
optimizations,  is  impossible  to  determine.  It  is 
desirable,  therefore,  to  represent  the  MNDO 
equations  in  a  simpler  form,  and,  where 
possible,  to  use  this  simpler  form  for  all 
operations  involving  modification  of  the  pa¬ 
rameters.  For  our  purposes,  then,  the  depen¬ 
dency  of  reference  functions  which  are 
derived  from,  for  example,  heats  of  forma¬ 
tion,  will  be  expressed  in  terms  of  the  pa¬ 
rameters  used  in  the  description  of  the  method 
as  a  simple  summation,  that  is 

gi(AP)  =  c^  +  2Ea^;*AP/.  (13) 

j  * 

where  is  the  calculated  heat  of  formation, 
and  APy  is  the  difference  in  parameter  j  be¬ 
tween  the  point  at  which  q,  was  evaluated, 
Pj,  using  MNDO  and  the  current  value  of 
Pj,  that  is,  AP;  =  (P°  -  P^). 

If  only  small  changes  in  the  values  of 
parameters  are  to  be  considered,  then  this 
power  series  can  be  truncated  so  that  all 
powers  greater  than  one  are  ignored: 

qi(AP)  =  Ci  +  Ila^  AP;  .  (14) 

J 
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DERIVATIVE  OF  REFERENCE 
FUNCTION  WITH  RESPECT  TO 
PARAMETERS 

Some  derivatives  are  zero:  for  a  molecule 
with  a  given  geometry  there  is  no  dependency 
of  the  ionization  potential  on  the  core¬ 
core  repulsion.  Theoretically,  these  deriva¬ 
tives  are  finite.  In  the  example  given  the 
core-core  repulsion  affects  the  geometry 
which,  in  turn,  affects  the  IP.  However,  any 
significant  change  in  the  core-core  potential 
is  compensated  for  by  equivalent  and  oppo¬ 
site  changes  induced  by  the  other  parame¬ 
ters.  This  state  of  balance  is  a  consequence 
of  the  constraints  imposed  on  the  system  by 
the  geometry  reference  functions;  the  opti¬ 
mized  geometries  are  typically  within  2%  of 
the  observed  values.  As  a  result,  such  deriva¬ 
tives,  although  finite,  can  safely  be  ignored. 
Some  derivatives  can  be  obtained  analyti¬ 
cally.  For  example,  the  derivative  of  an  ion¬ 
ization  potential  reference  function  with 
respect  to  the  's’  one-center  one-electron  pa¬ 
rameter  for  element  k  can  be  easily  obtained 
from  the  expression  for  the  MO  energy  level. 
For  closed-shell  systems,  Koopmans’  theo¬ 
rem  states  that  the  ionization  potential,  , 
is  the  negative  of  the  associated  molecular 
orbital  energy 

-/,  =  £;  =  +  (15) 

in  which  the  molecular  orbitals  are  normal¬ 
ized  LCAOs, 

(16) 

A 

For  IPs  the  reference  function  is  equivalent 
to  -lOgj,  therefore 

8qi/8U,Ak)  =  -IO^CkMKs,),  (17) 

X 

where  0(X,s*)  =  1  if  X  represents  an  's’ 
atomic  orbital  of  type  k,  and  zero  otherwise. 

Only  a  few  derivatives  can  be  calculated 
using  analytical  techniques;  most  deriva¬ 
tives  can  only  be  obtained  using  finite- 
difference  techniques.  Table  I  indicates 
how  each  type  of  derivative  is  obtained. 

RELATIONSHIPS  BETWEEN 
REFERENCE  FUNCTIONS 

By  using  dimensionless  reference  fimctions 
and  expressing  their  calculated  values  as  a 


simple  sum,  one  can  determine  the  similar¬ 
ity  or  difference  of  any  two  reference  func¬ 
tions.  For  both  the  optimization  of  parameters 
and  establishing  the  generality  of  the  method, 
as  wide  a  range  as  possible  of  reference  func¬ 
tions  must  be  used. 

Each  simplified  reference  function  gives 
rise  to  a  vector  which  defines  a  point  in  pa¬ 
rameter  space.  Although,  in  principle, 
exactly  n  reference  functions  would  be  suffi¬ 
cient  to  define  the  values  of  a  set  of  n  pa¬ 
rameters,  in  practice  many  more  reference 
functions  than  parameters  are  needed.  Inac¬ 
curacies  in  the  values  of  experimental  data 
may  bias  the  parameters.  By  using  a  large 
number  of  reference  functions,  errors  in  any 
given  reference  function  are  minimized. 
More  importantly,  reference  functions  are 
not  independent.  That  is,  two  reference  func¬ 
tions  may  have  a  similar  dependency  on  the 
parameters.  This  phenomenon  can  be  quan¬ 
tified  using  the  parameter  dependency  func¬ 
tions  derived  above. 

For  any  reference  function  the  parameter 
dependency  can  be  expressed  as  a  vector  of 
length  equal  to  the  number  of  parameters. 
To  determine  the  similarity  of  two  reference 
functions  their  associated  parameter  depen¬ 
dency  vectors  must  be  normalized.  This  is 
done  in  two  steps.  First,  since  the  number  of 
times  a  particular  element  occurs  in  a  com¬ 
pound  varies,  the  derivatives  for  each  ele¬ 
ment  are  individually  normalized,  thus: 

qiiiiP)  =  Ci  +  ^aijUPj,  (18) 


Table  I.  Method  used  for  calculation  of  first  deriva¬ 
tives. 


Parameter 

Heat  of  formation  Dipole 

IP  Geometry 

A 

F 

F 

0 

Upp 

A 

F 

F 

0 

/8. 

A 

F 

F 

F 

Pp 

A 

F 

F 

F 

i. 

A 

F 

F 

F 

A 

F 

F 

F 

a 

A 

0 

0 

0 

G„ 

F 

F 

F 

0 

G.P 

F 

F 

F 

0 

Gpp 

F 

F 

F 

0 

Gp2 

F 

F 

F 

0 

F 

F 

F 

0 

Oi* 

A 

0 

0 

F 

ho, 

A 

0 

0 

F 

Ci* 

A 

0 

0 

F 

Abbreviations:  A  =  analytical  derivative;  F  =  finite- 
difference  derivative;  O  =  derivative  is  zero. 
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^  =  2  Oy  ^  2  2  Oy  .  (19) 

j  k  lek 

where  k  is  the  index  of  an  element  present 
in  the  reference  function  g,. 

Let  Cij '  =  riiiflij  for  each  element,  where  n,* 
is  a  normalization  constant  for  element  k  in 
reference  function  i.  Then 

=  (20) 

j 

is  a  vector  representing  the  reference  func¬ 
tion  in  which  the  coefficients  of  each  ele¬ 
ment  are  normalized. 

<^'  is  a  vector  of  length  equal  to  the  num¬ 
ber,  Tit,  of  types  of  elements  in  the  com¬ 
pound.  This  vector  may  be  normalized  thus: 

<f>t  =  lctj  =  nt-^’^T}>\.  (21) 

J 

Using  these  vectors  the  similarity  or  over¬ 
lap  of  any  pair  of  reference  functions  can  be 
determined.  The  overlap  of  two  reference  func¬ 
tions,  is  given  by 

=  Icucji.  (22) 

i 

Note  that  by  representing  the  reference 
functions  by  the  vectors  <f>i>  all  chemical 
attributes  have  been  removed.  This  allows 
a  simple  comparison  to  be  made  between 
superficially  very  different  forms  of  data,  for 
example  the  ionization  potential  of  methanol 
and  the  C  —  O  —  C  angle  in  dimethyl  ether. 

Two  reference  functions  which  have  no 
elements  in  common  will  have  an  overlap  of 
zero.  Conversely,  the  overlap  of  two  similar 
compounds,  for  example  the  for  hexane 
and  heptane,  would  have  an  overlap  approach¬ 
ing  unity.  Two  reference  functions  may  be 
described  as  similar  if  their  overlap  is  large, 
greater  than  about  0.999,  and  different  if 
their  overlap  is  small,  less  than  about  0.8. 

To  adequately  sample  parametric  space 
with  a  minimum  number  of  reference  func¬ 
tions,  reference  functions  which  are  similar 
should  not  be  used. 

For  any  given  set  of  reference  functions, 
the  degree  of  independence  of  parameters 
can  be  determined  by  constructing  and  diag¬ 
onalizing  the  secular  matrix  of  such  over¬ 
laps.  Using  m  reference  functions  and  n 
parameters,  a  secular  matrix  of  size  m  by  m 
can  be  constructed.  Since  there  are  only  n 
possible  independent  variables,  Uiagonaliza- 
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tion  of  this  matrix  will  yield  at  least  (m-n) 
exactly  zero  eigenvalues  and  at  most  n  posi¬ 
tive  eigenvalues.  These  latter  n  eigenvalues 
provide  an  indication  of  the  independency  of 
the  parameters.  Consider  a  single-element, 
n -dimensional,  parameter  space  and  n  refer¬ 
ence  functions.  If  these  parameters  were 
fully  independent,  that  is,  the  overlap  of  any 
two  reference  functions  being  zero,  then  all 
eigenvalues  would  be  identically  unity.  If 
the  parameters  are  interdependent,  then 
some  roots  would  be  larger  than  one,  while 
others  become  less.  If  any  two  parameters 
have  exactly  the  same  behavior,  then  one  of 
the  n  roots  would  become  exactly  zero.  For 
convenience  in  expressing  the  interdepen¬ 
dency  of  parameters,  the  ratio  of  the  total 
number  of  reference  functions  to  the  smallest 
eigenvalue  will  be  used. 

The  principal  desideratum  of  a  reference 
function  is  that  it  decrease  this  ratio.  Any 
reference  function  which  does  not  contrib¬ 
ute  significantly  to  the  lowest  eigenvector 
will  not  affect  its  value  significantly.  Conse¬ 
quently,  removal  of  such  a  function  will  de¬ 
crease  the  number  of  reference  functions  by 
unity,  with  a  concommitant  decrease  in  the 
interdependency  of  the  parameters. 

THE  ERROR  FUNCTION 

With  one  qualification,  a  sufficient  and 
necessary  condition  for  an  optimized  set  of 
parameters  is  that  the  error  function,  S,  de¬ 
fined  as  the  sum  of  the  squares  of  the  differ¬ 
ences  between  calculated  and  experimental 
values  for  the  reference  functions,  should  be 
a  minimum.  That  is,  given 

S  =  S  (9“*'  -  (23) 

i 

where  q'®*'  and  qf*’’  are  the  calculated  and 
experimental  values  for  reference  functions, 
then  the  parameter  set  is  optimized  when  S 
is  a  minimum. 

If  only  a  small  number  of  reference  func¬ 
tions  are  used,  then  multiple  minima  are 
possible.  The  presence  of  false  minima  can 
be  detected  by  surveying  a  large  number  of 
reference  functions.  If  the  parameters  are 
trapped  in  a  local  false  minimum,  then  there 
will  be  a  set  of  reference  functions  which 
will  be  badly  predicted.  By  including  some 
reference  functions  from  this  set,  these  false 
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minima  may  be  eliminated.  There  is,  unfor¬ 
tunately,  no  simple  test  which  indicates  when 
the  global  rather  than  local  minimum  has 
been  reached  other  than  searching  for  refer¬ 
ence  functions  which  are  badly  predicted. 

With  this  caveat  regarding  local  minima, 
the  task  of  optimizing  parameters  is  defined. 
It  should  be  emphasized  that  no  criteria  for 
the  values  of  the  parameters  other  than  mini¬ 
mizing  S  are  necessary.  The  specification  of 
a  reference  function  is  sufficiently  flexible  to 
allow  extra  weight  to  be  given  to  a  particu¬ 
lar  property  of  a  particular  compound  or  re¬ 
action.  As  the  stated  objective  is  to  predict 
the  values  of  chemical  properties,  the  values 
of  the  parameters  as  such  are  of  little  impor¬ 
tance.  In  particular,  any  relationship  between 
parameters  and  quantities  of  theoretical 
significance  would  be  fortuitous.  This  is  ex¬ 
emplified  by  the  fact  that  several  low-lying 
eigenvalues  result  from  diagonalizing  the 
matrix  of  second  derivatives  of  the  reference 
functions.  Small  eigenvalues  indicate  that 
large  changes  may  be  made  to  the  values  of 
the  parameters  with  only  a  small  change  re¬ 
sulting  in  the  value  of  S. 

Although  the  parameters  cannot  be  easily 
related  to  quantum-mechanical  quantities, 
in  a  limited  number  of  cases  some  physical 
meaning  can  be  associated  with  them.  The 
gaussians,  which  modify  the  core-core  inter¬ 
action,  are  an  example.  Their  purpose  is  to 
modify  the  interatomic  interaction  over  the 
distances  within  which  chemical  bonds  are 
important,  and  secondarily  to  modify  "long- 
range”  interactions  up  to  approximately  3  A. 
A  large,  narrow  gaussian  centered  in  this  lat¬ 
ter  region  would  not  be  physically  meaningful. 

Since  the  reference  functions  are  experi¬ 
mental,  that  is,  they  take  into  account  all 
quantum  mechanical  operators,  and  the 
quantum  mechanical  framework  of  MNDO 
is  limited,  each  parameter  which  represents 
a  quantum  mechanical  operator  will  also 
represent  all  its  operator  equivalents.  Thus 
the  carbon  's’  orbital  exponent,  while  in 
principle  representing  the  2s  orbital  on  car¬ 
bon,  will  also  represent  all  higher  's’  orbitals. 
Similarly,  the  gaussian  core-core  interac¬ 
tions  which,  in  AMI,  were  intended  to  correct 
for  an  excessive  long-range  repulsion  are  also 
operator  equivalents  to  the  Van  der  Waals 
attraction  or  dispersion  effects  and  can  there¬ 
fore  behave  as  if  they  were  dispersion  forces 


operators.  In  other  words,  the  correlation 
effects,  which  in  ab  initio  methods  require 
extensive  calculations  of  the  Mpller-Plesset 
type,  can,  at  the  MNDO  level  of  approxima¬ 
tion,  be  represented  by  simple  gaussian  func¬ 
tions.  Conversely,  any  phenomenon  which 
cannot  be  represented  by  any  of  the  operators 
in  MNDO  cannot  be  reproduced.  Examples 
of  such  phenomena  are  given  in  the  Discus¬ 
sion  section. 


OPTIMIZATION  OF  PARAMETERS 

For  quadratic  functions,  minimization  is 
obtained  in  one  step  by  the  operation 

P(opt)  =  P  -  g/H  (24) 

where  H  is  the  Hessian  matrix  of  second 
derivatives,  and  g  is  the  derivative  vector 
8S/8P. 

Although  both  g  and  H  can  be  readily  cal¬ 
culated,  the  ratio  of  highest  to  lowest  eigen¬ 
values  of  H  is  very  large  —  when  a  large 
number  of  parameters  are  involved,  this  ra¬ 
tio  can  exceed  10^^ — and  the  inverse  of  H  is 
numerically  very  sensitive  to  small  changes 
in  parameters.  In  part,  this  is  a  consequence 
of  the  neglect  of  higher-order  terms  in  the 
simplified  reference  function,  which  preclude 
the  possibility  of  negative  eigenvalues  aris¬ 
ing  from  the  Hessian  of  S.  'This,  in  turn,  leads 
to  completely  unrealistic  values  for  g/H, 
and  the  'optimized’  parameters  are  invari¬ 
ably  worse  than  the  initial  set. 

Several  straightforward  techniques  exist 
for  function  optimization;  the  DFP^^  the 
Murtagh-Sargent^®  and  the  BFGS.^®  Each 
method  determines  a  search  direction  based 
on  the  local  gradient  and  an  approximation 
to  the  inverse  Hessian.  In  all  these  methods 
the  accuracy  of  the  inverse  Hessian  improves 
as  the  optimization  proceeds. 

An  initial  attempt  was  made  to  optimize 
parameters  using  the  DFP  update  formula. 
The  time  per  function  evaluation — in  this 
case  a  full  geometry  optimization  followed 
by  derivative  calculation  —  was  prohibitive. 
The  time  required  for  the  geometry  opti¬ 
mization  step  alone  was  approximately  24 
hours  for  the  final  reference  function  basis 
set  used,  starting  from  standard  bond  lengths 
and  angles.  Direct  function  evaluation  was 
clearly  impractical.  Therefore  an  alternative 
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method  for  determining  how  to  modify  pa¬ 
rameters  had  to  be  developed. 

For  small  changes  in  parameters,  the  value 
of  the  reference  functions,  and  consequently 
S,  can  be  calculated  using  Eq.  (14).  This  al¬ 
lows  for  rapid  evaluation  of  S  at  points  other 
than  that  of  the  initial  function  evaluation. 

In  addition,  since  the  local  gradient  function 
can  be  calculated  from  Eq.  (14),  not  only  can 
a  line  search  be  carried  out  but  new  search 
directions  can  be  calculated.  In  its  current 
form,  however,  the  optimization  cannot  be 
allowed  to  proceed  to  completion  for  two  rea¬ 
sons:  la)  the  function  S  has  quadratic  depen¬ 
dency  on  P  over  only  a  small  domain  around 
the  point  of  the  full  function  evaluation,  and 
lb)  in  the  limit,  the  numerical  instability  of 
the  inverse  Hessian  would  result  in  unrealis¬ 
tic  changes  in  the  parameters.  The  addition 
to  S  of  a  quadratic  term  of  form  c(Ap^)  ■ 

I  -  P*  ')“,  where  AP  is  the  change  in  P 
from  the  initial  point,  and  P*  -  P*  '  is  the 
change  in  the  values  of  the  parameters  be¬ 
tween  the  start  of  the  previous  cycle  of  opti¬ 
mization  and  the  start  of  the  current  cycle, 
moderated  the  excursion  of  the  P  vector.  The 
multiplier  c  was  initially  set  to  an  arbitrary 
positive  value  and  subsequently  modified  in 
order  to  maximize  the  rate  of  optimization. 
Various  empirical  rules  were  evolved  to  de¬ 
termine  the  value  of  c  on  any  given  cycle. 
The  final  set  used  are  as  follows: 

a.  If  the  final  predicted  decrease  in  S  and 
the  amount  obtained  by  a  full  function  evalu¬ 
ation  were  within  25%,  the  value  of  c  was 
halved. 

b.  If  the  angle  between  the  AP  vectors  on 
two  successive  cycles  was  within  5°,  c  was 
halved. 

c.  If  the  final  decrease  in  S  calculated 
from  a  full  function  evaluation  was  less  than 
50%  of  that  predicted,  the  value  of  c  was 
doubled. 

d.  If  the  angle  between  the  AP  vectors  on 
two  successive  cycle.*^  was  greater  than  30°,  c 
was  doubled. 

After  the  parameters  are  optimized,  sub¬ 
ject  to  this  perturbation,  a  new  explicit  func¬ 
tion  evaluation  is  done.  Because  there  is  de 
facto  no  error  in  either  S  or  g  at  this  new 
point,  the  origin  of  the  quadratic  restraining 
term  used  in  the  previous  cycle  is  moved  to 
the  current  point.  The  rapid  optimization  is 
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then  resumed.  Finally,  during  the  optimiza¬ 
tion  some  parameters  tended  to  quite  unre¬ 
alistic  values.  This  was  a  natural  consequence 
of  the  low  eigenvalues  for  the  parameter 
Hessian  matrix.  Although,  as  we  have  seen, 
no  physical  meaning  can  be  attached  to  any 
given  parameter,  it  was  nevertheless  consid¬ 
ered  appropriate  to  apply  boundary  condi¬ 
tions  to  all  parameters.  It  is  likely  that 
parameter  sets  which  contain  unrealistic 
values  will  eventually  be  found  to  be  unsuit¬ 
able  for  use  in  the  study  of  certain  phenom¬ 
ena,  where  a  more  realistic  parameter  set 
would  be  found  to  be  suitable.  For  each  pa¬ 
rameter  an  upper  and  lower  bound  was  as¬ 
signed.  If  the  value  of  a  parameter  exceeded 
its  allowed  bound,  the  function  value  within 
the  DFP  was  increased  by  100  times  the 
square  of  the  amount  by  which  the  defined 
bound  was  exceeded.  This  sequence  was 
continued  until  the  parameters  converged 
within  a  preset  limit.  Using  this  procedure 
the  sequence  for  optimizing  parameters 
becomes: 

1.  Full  calculation  of  S  and  g. 

2.  Line  search  optimization  of  S  using  Eq.  ( 14) 
and  a  quadratic  restraining  term. 

3.  New  g  evaluated  by  direct  differentiation 
of  Eq.  (14)  and  the  quadratic  term  at  the 
minimum  located  using  step  2. 

4.  The  approximate  inverse  Hessian  updated 
according  to  the  DFP  formula. 

5.  Steps  2,  3,  and  4  repeated  until  the 
parameters  are  converged  within  a  preset 
limit.  The  sequence  2-3-4  constitutes 
one  cycle  of  optimization.  Since  the  first 
derivative  of  the  energ>'  with  respect  to 
optimized  geometry  is  zero,  and  the  IPs 
and  dipoles  are  relatively  insensitive  to 
small  geometry  changes,  molecular 
geometries  were  only  optimized  every 
few  cycles. 

6.  Unless  the  change  in  parameters  is  ac¬ 
ceptably  small,  the  sequence  is  repeated 
starting  from  step  1. 

During  the  initial  stages  of  an  optimiza¬ 
tion,  the  value  of  S  dropped  relatively  rap¬ 
idly.  This  corresponded  to  motion  in  P  space 
along  the  higher  eigenvectors.  Once  these 
gross  changes  were  made,  the  rate  of  opti¬ 
mization  dropped.  The  average  decrease  in 
the  value  of  S  on  any  function  evaluation 
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within  the  DF  F  procedure  was  approxi¬ 
mately  0.00 1*", .  About  1500  function  evalua¬ 
tions  wurc  performed  during  step  5  in  the 
above  sequence.  Since  each  function  evalu¬ 
ation  involved  only  a  double  sum,  the  time 
required  for  each  of  these  rapid  function 
evaluations  was  short  With  400  reference 
functions  and  198  parameters  being  opti¬ 
mized  simultaneously,  each  line  search 
(which  involved  typically  5  function  evalua¬ 
tions  and  calculation  of  derivatives!  required 
5  s  on  a  Gould  MVIOOOO. 

SPECIFICATION  OF  THE  MOLECULAR 
BASIS  SET 

For  each  element  an  initial  trial  set  of 
molecule.s  and  ions  was  chosen  to  represent 
its  chemistry.  After  the  parameters  were  op¬ 
timized.  the  full  range  of  compounds  were 
surveyed.  Compounds  which  had  large  dif¬ 
ferences  between  calculated  and  experimen¬ 
tal  values  for  properties  were  included  in 
the  basis  set  and  the  parameters  reoptimized. 

Initially,  no  organic  polynitro  compounds 
were  involved.  After  optimizing  the  parame¬ 
ters  using  this  deficient  basis  set,  a  survey 
was  carried  out.  This  revealed  that  heats  of 
formation  for  organic  polynitro  compounds 
were  very  badly  predicted.  Some  representa¬ 
tive  polynitro  compounds  were  added  to  the 
basis  set,  and  the  parameters  reoptimized. 
At  the  end  of  the  optimization,  it  was  found 
that  the  S  for  the  original  basis  set  had  not 
risen,  but  the  errors  for  polynitro  organics 
had  dropped  considerably.  The  fact  that  the 
original  parameter  set  was  optimized  and 
yet  a  markedly  different  parameter  set  could 
give  almost  identical  results  for  the  original 
basis  set  was  attributed  to  the  presence  of 
almost  zero  eigenvalues  in  the  parameter 
Hessian:  motion  in  any  of  the  directions  of 
the  almost  zero  eigenvectors  would  not  af¬ 
fect  the  S  significantly.  Clearly  the  original 
large  errors  in  the  polynitro  compounds 
were  due  to  the  parameter  set  being  insensi¬ 
tive  to  perturbations  in  certain  directions 
using  the  original  basis  set.  On  adding 
polynitro  compounds  to  the  basis  set  this  in¬ 
sensitivity  was  removed,  and  a  new,  more 
general,  minimum  was  reached. 


PARAMETER  SETS 

During  the  optimization,  constraints  were 
imposed  on  the  allowed  values  of  parame¬ 
ters.  Even  when  a  large  number  of  molecules 
are  used  for  defining  the  values  of  parame¬ 
ters,  the  potential  still  exists  for  an  un¬ 
realistic  parameter  set  to  be  generated.  In 
particular,  if  the  width,  parameter  '6,’  of  the 
core-core  interaction  gaussians  [Eq.  (4)1,  is 
unconstrained,  then  during  the  initial  phase 
of  parameterization  these  '6’  values  could 
rise  to  unrealistic  values.  Inspection  of  the 
optimized  gaussian  invariably  revealed  that 
it  was  centered  at  a  distance  corresponding 
to  a  specific  bond  length  in  a  specific  com¬ 
pound.  When  that  gaussian  was  removed, 
only  one  compound  was  affected,  and  the 
error  in  the  calculated  heat  of  formation  of 
that  compound  changed  from  an  initial 
value  of  zero  to  t3q)ically  a  few  tens  of  kcal/ 
mole.  To  prevent  this,  the  'b’  values  were 
constrained  to  the  range  2. 0-6.0  A. 

For  iodine,  two  completely  stable  sets  of 
parameters  were  found.  No  path  was  found 
by  which  one  set  of  parameters  could  be  con¬ 
verted  into  the  other  set  without  passing 
through  a  region  of  very  high  S.  One  set  cor¬ 
responded  to  a  large  and  small  ^p,  while 
the  other  corresponded  to  small  and  large 
^p.  Since  the  two  sets  were  separated  by  a 
barrier,  and  since  a  large  would  be  ex¬ 
pected  for  iodine,  the  first  set  of  parameters 
is  likely  to  be  the  more  correct. 

No  complications  were  encountered  with 
parameter  sets  for  the  other  elements.  The 
final  values  for  all  parameter  sets  are 
presented  in  Table  II. 

TECHNICAL  DETAILS  FOR  USING 
MNDO-PM3  WITH  EXISTING  PROGRAMS 

In  order  for  calculations  using  MNDO-PM3 
to  be  carried  out  using  earlier  copies  of 
MOPAC  (Versions  3.1  or  higher)  or  AMPAC, 
the  key  words  EXTERNAL  and  AMI  must 
be  used.  MNDO-PM3  parameters  for  an  ele¬ 
ment  have  two  gaussians,  therefore  it  would 
be  necessary  to  set  gaussians  3  and  4  to  zero. 
The  parameter  names  for  the  parameters 
given  in  Table  II  are,  in  order,  USS,  UPP, 
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ZSS,  ZPP,  BETAS,  BETAP,  GSS,  GSP,  GPP, 
GP2,  HSP,  ALP,  FNll,  FN21,  FN31,  FN12, 
FN22,  FN32,  and,  in  order  to  eliminate  the 
third  and  fourth  gaussian  of  AMI,  FN13, 
FN14.  The  end  of  the  oxygen  EXTERNAL 
parameter  set  would  thus  be 

FN22  0  5.950512 

FN32  0  1.598395 

FN13  0  0.000000 

FN14  0  0.000000 

To  verify  that  the  parameters  have  been 
correctly  entered,  calculations  on  represen¬ 
tative  systems  should  be  done. 

CONCLUSION 

The  new  method  for  optimizing  parame¬ 
ters  for  semiempirical  quantum-mechanical 
methods  is  several  orders  of  magnitude  faster 
than  those  previously  employed.  Highly  op¬ 
timized  values  for  parameters  for  12  ele¬ 
ments  have  been  obtained  which  are  suitable 
for  use  in  the  study  of  chemical  systems.  Al¬ 
though  there  is  no  unambiguous  way  of 
determining  if  the  current  parameter  set 
corresponds  to  the  global  minimum,  exten¬ 
sive  checking  of  the  applicability  of  the  pa¬ 
rameters  to  a  wide  range  of  compounds  (fol¬ 
lowing  article)  does  not  suggest  that  any 
significantly  lower  minimum  exists. 

The  presence  of  several  almost  zero  eigen¬ 
values  in  the  parameter  Hessian  matrix 
indicates  that  the  parameter  set  is  not  unam¬ 
biguously  defined,  and  that  caution  must  be 
exercised  in  applying  these  parameters  to 
the  study  of  new  systems.  A  welcome  corol¬ 
lary  of  this  is  that  if  any  deficiencies  are 
identified,  and  that  the  reference  functions 
representing  these  deficiencies  can  increase 
the  value  of  any  of  these  almost  zero  eigen¬ 
values,  indicating  that  the  position  in  pa¬ 
rameter  space  of  the  associated  eigenvector 


has  become  defined,  then  the  parameter  set 
can  likely  be  reoptimized  so  as  to  remove 
these  deficiencies  without  compromising  the 
applicability  of  the  parameters  to  known 
systems. 
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